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QUESTION 1 

A researcher wishes to study students’ satisfaction (strongly disagree=1, disagree=2, 
neutral=3, agree=4, strongly agree=5) towards the services provided by the Academic 
Affairs in Medical College X. The researcher chooses only 10 out of 50 classes. All the 
students from these 10 classes will be used for the study. 

a) Identify the population for the above study. 

(1 mark) 

b) State the sampling frame for the above study. 

(1 mark) 

c) Name the variable of interest for the above study. State its type and its level of 
measurement. 

(3 marks) 

d) State the sampling technique used in this study. Explain your answer in the context of 
the study. 

(2 marks) 


QUESTION 2 

The stem and leaf plot below represents the Mathematics test scores (out of 100) of 15 
randomly selected students in Class Angle. 


Stem 

Leaf 

4 

3 

5 

248 

6 

2357889 

7 

1 58 

8 

9 


Key 4 I 3 means 4: 


a) Calculate the mean and standard deviation for the test score. 


(4 marks) 


b) Interpret the mean obtained in a). 


(1 mark) 
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c) The summary statistics of the test score for Mathematics and Biology students in Class 
Angle are summarized in the following table. Using an appropriate measure, determine 
which distribution of the test score between the subjects is more dispersed. 


Descriptive statistics 



N 

Mean 

Std. Deviation 

Mathematics 

15 

65.4667 

11.1859 

Biology 

17 

66.9412 

17.5623 


(3 marks) 


QUESTION 3 

A diet trainer wants to determine whether there is a significant difference in the mean weight 
loss among four diet programs (low calorie diet, low fat diet, low carbohydrate diet and a 
control group). The data were collected and the result of the weight loss is shown below. 


AN 

OVA 


Sum of 


Mean 



Squares 

df 

Square 

F 

Between Groups 

A 

3 

22.317 

C 

Within Groups 

41.600 

B 

2.600 


Total 

108.550 

19 




a) Find the values of A, B and C. 


(3 marks) 


b) State the null and alternative hypotheses for this study. 


(2 marks) 


c) 


Using F-test, is there sufficient evidence to indicate that there is a difference in mean 
weight loss among the four diet programs? Use a = 0.05. 

(3 marks) 


QUESTION 4 

A manager at Company Brilliant wishes to determine whether the employees’ work 
satisfaction is related to their respective department. The results obtained are shown below. 



Value 

df 

Asymp. Sig. 
(2-sided) 

Pearson Chi-Square 

D 

2 

0.541 

Likelihood Ratio 

2.522 

3 

0.471 

Linear-by-Linear Association 

N of Valid Cases 

1.314 

93 

1 

0.252 
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Satisfaction * Department Crosstabulation 





Department 





Finance 

Sales 

Human 

Resource 

IT 

Total 



Count 

12 

38 

5 

8 

63 

Satisfaction 

Satisfied 

Expected 

Count 

12.9 

38.6 

5.4 

6.1 

63.0 

Nnt 

Count 

7 

19 

3 

1 

30 


INL/L 

Satisfied 

Expected 

Count 

6.1 

E 

2.6 

2.9 

30.0 

Total 


Count 

19 

57 

8 

9 

93 



Expected 

Count 

19.0 

57.0 

8.0 

9.0 

93.0 


a) Using an appropriate formula, compute the values of D and E. 


(5 marks) 


b) 

c) 


State the null and alternative hypotheses for this study. 


(1 mark) 

Based on the p-value, is there sufficient evidence to conclude that employees’ work 
satisfaction is related to their respective department? Use a = 0.05. 


(3 marks) 


QUESTION 5 

A statistics lecturer intends to investigate whether there is sufficient evidence to conclude 
that the average score was different from the expected average score of 74. A random 
sample of 15 students were selected and analyzed using SPSS. The result obtained is as 
follow. 



N 

Mean 

Std. Deviation 

Std. Error Mean 

Exam Score 

15 

74.4870 

13.5564 

3.5002 


a) Identify the statistical test used for this study. 

b) Show that the standard error of the mean is 3.5002. 

c) Construct a 95% confidence interval for the average score. 


(1 mark) 
(2 marks) 
(4 marks) 
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d) 


Based on the confidence interval in c), does the average score differ from 74? Give a 
reason to support your answer. 

(2 marks) 


QUESTION 6 

One indicator of physical fitness is resting pulse rate. Ten men volunteered to test the 
exercise device as advertised on television by using it three times a week for 20 minutes. 
Their pulse rate (beats per minute) were measured before and after six week of the test. The 
data were recorded and analyzed using SPSS. The results are shown below. 


Paired Samples Statistics 



Mean 

N 

Std. Deviation 

Std. Error Mean 

Before 

85.2000 

10 

6.89283 

2.17970 

Pair 1 

After 

82.0000 

10 

5.94418 

1.87972 



Pair 1 


Before - After 

Paired Differences Mean 

G 

Std. Deviation 

2.14994 

95% Confidence Interval of Lower 

1.66203 

the Difference Upper 

4.73797 

t 

4.707 

df 

9 


a) Find the value of G. 

(1 mark) 

b) Prove the t-statistic value for the above study. 

(2 marks) 

c) State the 95% confidence interval for the mean difference before and after six weeks of 
the test. 

(1 mark) 

d) At the 5% level of significance, is there sufficient evidence to indicate that this kind of 
exercise device could reduce resting pulse rate? 

(4 marks) 
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QUESTION 7 

A study is conducted to determine the relationship between the length of advertisement 
(in seconds) aired on TV and sales (in hundreds of RM) of selected ten companies. The 
data were recorded and analyzed using SPSS. The results are shown below. 


Length of 
advertisement 

16 

36 

22 

6 

15 

12 

5 

29 

28 

21 

Sales 

48 

69 

71 

14 

15 

29 

20 

39 

65 

30 


Coefficients 


Model 

Unstandardized Coefficients 

Standardized 

Coefficients 

t 

Sig. 


B 

Std. Error 

Beta 


(Constant) 

1 

Length of advertisement 

8.017 

10.521 


.762 

.468 

1.683 

.494 

.769 

3.406 

.009 


a) Using the data given, calculate the correlation coefficient between the length of 
advertisement and sales. Explain the value obtained. 

(4 marks) 

b) Based on the output, write the least square regression equation. Hence, interpret the 
value of the slope. 

(2 marks) 

c) Calculate the value of the coefficient of determination. Hence, interpret the value. 

(3 marks) 

d) Predict the sales of the company if the length of advertisement is 10 seconds. 

(2 marks) 


END OF QUESTION PAPER 
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SAMPLE MEASUREMENTS 


Mean 

n 

Standard deviation 

s = 

1 

1 

I 

CM 

> 

I_ 

or 

n-1 

n 

n-1 



Coefficient of Variation 

o 

< 

II 

XI 1 0) 

X 

o 

o 

Pearson’s Measure of Skewness 

Coefficient of Skewness = 

3(nnean-median) mean-mode 

standard deviation standard deviation 
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CONFIDENCE INTERVAL 


_ Parameter and description 

Mean n, for large samples 


Mean p, for small samples, 
variance o^ unknown 


Difference in means of two normai 
distributions 


A (1 - a) 100% confidence intervai 


_ , CT s 

x±z^j 2 —F= or x±z 


Vn 


“a/2 




X±ta/2-r ' df=n-1 

vn 


M 1 

(Xi-X 2 )±t„/ 2 Sp —+ — : df=ni + n 2-2 

Vni n2 


pi - P2 , 

variances = ctI and unknown 


Sp = 


(ni - 1)sf + (n 2 - 1)si 

' ni+n2-2 


(Xi 


) ± ta/2 


I—+ 

CM CM 
(/) 

/ ni 

n2 


Difference in means of two normal 
distributions 

pi - M.2 , 

variances 0 ^ ^ 02 and unknown 


Mean difference of two normal distributions 
for 

paired samples, pa 




df = n -1 where n is no. of pairs 
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HYPOTHESIS TESTING 
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ANALYSIS OF VARIANCE FOR A COMPLETELY RANDOMIZED DESIGN 


Let: 


k = the number of different samples (or treatments) 

rij = the size of sample i 

y _ the sum of the values in sample I 


n = the number of values in all samples 

= «j+n2+«3+— 

^ X = the sum of the values in all samples 

= 7;+r2+7;+... 

= the sum of the squares of values in all samples 


Degrees of freedom for the numerator = k -1 
Degrees of freedom for the denominator = n - k 


Total sum of squares: SST = ^x^ 
Between-samples sum of squares: 




SSB- 


T2 "r2 
M *9 ‘ 

vPi f 


l! 

n. 


(Ex)^ 


Within- samples sum of squares = SST - SSB 


SSB 

/ariance between samples: MSB=- 

(k-1) 

SSW 

/ariance within samples: MSW=- 

fn-k^ 


Test statistic for a one-way ANOVA test: 


MSB 

MSW 
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SIMPLE LINEAR REGRESSION 
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